(navigation image)
Home Wayback Machine | Archive-It | Blog | Heritrix
Search: Advanced Search
Anonymous User (login or join us)
Upload

Download item

[item image]

Play / Download (help[help])


All Files: HTTPS

Resources

Bookmark

CuilCuil crawldata 20071221050441 to 20071221071938 (2007)

Crawl data captured by Cuil from Fri Dec 21 05:04:41 UTC 2007 to Fri Dec 21 07:19:38 UTC 2007


This item is part of the collection: Cuil Crawl Data

Mediatype: web
Identifier: cuil6-domainshard-corpus5-large-merge-rev1.19682-of-25000
Addeddate: 2013-01-25 20:45:00
Publicdate: 2013-02-02 08:28:15
Date: 2007
Firstfiledate: 20071221052004
Lastfiledate: 20071221052004
Scandate: 20071221052004
Bad_records: 12
Good_records: 1081916
Contributor: Cuil
Creator: Cuil
Identifier-access: http://archive.org/details/cuil6-domainshard-corpus5-large-merge-rev1.19682-of-25000
Numwarcs: 1
Sponsor: Cuil
Imagecount: 1081917
Keywords: crawldata


Individual Files

Information FormatSize
cuil6-domainshard-corpus5-large-merge-rev1.19682-of-25000_files.xml Metadata [file]
cuil6-domainshard-corpus5-large-merge-rev1.19682-of-25000_meta.xml Metadata 1.1 KB

Be the first to write a review
Downloaded 42 times
Reviews


Terms of Use (10 Mar 2001)